Model Selection

Continued Pretraining

# Continued Pretraining

Gemma 2 Llama Swallow 27b It V0.1

A Japanese-enhanced large language model based on the Gemma-2 architecture, significantly improving Japanese capabilities while retaining original English proficiency

Large Language Model

Transformers Supports Multiple Languages

Taiwan Tinyllama V1.0 Chat

This is a Tinyllama model specifically optimized for Traditional Chinese through continued pretraining, based on the TinyLlama-1.1B architecture with a pretraining dataset of approximately 2 billion tokens.

Large Language Model

Transformers Chinese

Llama 3 Youko 8b

A Japanese-optimized model based on Meta-Llama-3-8B, continuously pretrained on a mixed dataset of Japanese and English with 22 billion tokens

Large Language Model

Transformers Supports Multiple Languages

A large instruction-based language model customized for the legal domain, obtained through continued pretraining on Mistral-7B

Large Language Model

Transformers English

Swallow MS 7b V0.1

Swallow-MS-7b-v0.1 is a Japanese-enhanced model based on Mistral-7B-v0.1 with continued pretraining, developed by TokyoTech-LLM, demonstrating excellent performance on Japanese tasks.

Large Language Model

Transformers Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase